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Distributed  Real-Time  System  Design: 
Theoretical  Concepts  and  Applications 


Abstract:  Distributed  real-time  system  design  raises  new  theoretical  issues  and 
application  challenges,  beyond  those  of  centralized  systems.  Rate  monotonic 
scheduling  (RMS)  theory  has  been  successfully  applied  in  the  scheduling  of  cen¬ 
tralized  systems.  RMS  and  its  generalizations  have  been  adopted  by  national  high 
technology  projects  such  as  the  Space  Station  and  has  recently  been  supported 
by  major  open  standards  such  as  the  IEEE  Futurebus+  and  POSIX.4.  In  this 
paper,  we  describe  the  use  of  generalized  rate  monotonic  scheduling  theory  for 
the  design  and  analysis  of  a  distributed  real-time  system.  We  review  the  recent 
extensions  of  the  theory  to  distributed  system  scheduling,  examine  the  architec¬ 
tural  requirements  for  use  of  the  theory,  and  finally  provide  an  application  ex¬ 
ample. 


1 .  Introduction 

Real-time  computing  systems  are  critical  to  an  industrialized  nation’s  technological  in¬ 
frastructure.  Modem  telecommunication  systems,  factories,  defense  systems,  aircraft  and 
airports,  space  stations  and  high  energy  physics  experiments  cannot  operate  without  them. 
Indeed,  real-time  computing  systems  control  the  very  systems  that  keep  us  productive, 
safeguard  our  liberty,  and  enable  us  to  explore  new  frontiers  of  science  and  engineering. 

In  real-time  applications,  the  correctness  of  a  computation  depends  upon  not  only  its  results 
but  also  the  time  at  which  outputs  are  generated.  The  measures  of  merit  in  a  real-time  sys¬ 
tem  include: 

•  Predictably  fast  response  to  urgent  events. 

•  High  degree  ef  schedulability.  Schedulability  is  the  degree  cf  resource  utiliza¬ 
tion  at  or  below  which  the  timing  requirements  of  tasks  can  be  ensured.  It  can 
be  thought  as  a  measure  of  the  number  ofjtimely  transactions  per  second. 

•  Stability  under  transient  overload.  When  the  system  is  overloaded  by  events 
and  it  is  impossible  to  meet  all  the  deadlines,  we  must  still  guarantee  the  dead¬ 
lines  of  selected  critical  tasks. 

Real-Time  scheduling  is  a  vibrant  field.  Generalized  Rate  Monotonic  theory  (GMRS),  one  of 
several  important  research  efforts  ( [10],  [11]),  is  a  useful  tool  that  allows  system  developers 
to  meet  the  above  requirements  by  managing  system  concurrency  and  timing  constraints  at 
the  level  of  tasking  and  message  passing.  In  essence,  this  theory  ensures  that  all  tasks 
meet  their  deadlines  as  long  as  the  system  utilization  of  all  tasks  lies  below  a  certain  bound, 
and  appropriate  scheduling  algorithms  are  used.  This  puts  the  development  and  mainte¬ 
nance  of  real-time  systems  on  an  analytic,  engineering  basis,  making  these  systems  easier 
to  develop  and  maintain. 
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Application  of  this  theory  to  centralized  system  scheduling  is  well  known  [5].  The  focus  of 
this  paper  is  on  recent  generalization  of  this  theory  to  schedule  large  scale  distributed  real¬ 
time  systems.  In  addition,  we  provide  an  example  that  illustrates  some  of  the  practical  devel¬ 
opment  problems  in  the  application  of  this  approach  in  actual  system  development:  to  ad¬ 
dress  the  increasing  trend  towards  open  system  components  and  to  control  costs,  system 
designers  have  to  use  standardized  subsystem  components,  which  may  not  support  real¬ 
time  computing  needs.  To  reduce  the  number  of  processors  in  a  system,  sometimes  both 
real-time  and  non-real-time  applications  must  co-exist  in  a  processor  and  share  the  network 
with  real-time  traffic. 

The  following  is  a  high  level  view  of  our  example  application,  which  will  be  presented  and 
solved  in  detail  later  in  this  paper.  Since  a  widely  available  standard  network  that  supports 
GRMS  does  not  currently  exist,  we  build  our  example  system  around  the  ANSI  X3T9.5  FDDI 
network  as  shown  in  Figure  1-1.  Since  IEEE  Futurebus+  (896)  and  POSIX.4a  support  the 
use  of  GRMS  for  real-time  applications,  we  use  them  in  our  example  as  the  station  back¬ 
plane  and  operating  system  respectively.  From  the  application  view  point,  our  example  con¬ 
sists  of  both  classical  real-time  surveillance  and  control  applications,  and  multimedia  ap¬ 
plications. 


Remote  Audio/Video 

Sensor  Monitoring 


Figure  1-1 :  Block  Diagram  of  Distributed  Real-Time  System 

The  rest  of  the  paper  is  organized  as  follows.  Chapter  2  presents  a  synopsis  of  generalized 
rate  monotonic  theory  for  centralized  systems.  Chapter  3  describes  the  theoretical  exten¬ 
sions  necessary  for  applying  GRMS  to  a  distributed  system.  Chapter  4  introduces  the  no¬ 
tion  of  scheduling  abstractions,  a  technique  for  using  and  analyzing  existing  subsystems 
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that  are  not  designed  to  support  GRMS.  This  is  a  very  important  aspect  in  the  application  ol 
this  theory  to  real-wcr'd  systems.  Chapter  5  describes  a  comprehensive  example  that  il¬ 
lustrates  task  scheming  within  subsystems  as  well  as  end-to-end  scheduling  in  a  large  real¬ 
time  system.  F^iily  we  make  concluding  remarks  in  Chapter  6. 
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2.  Synopsis:  GRMS  in  Centralized  Systems 

A  real-time  system  typically  consists  of  both  periodic  and  aperiodic  tasks.  A  periodic  task  t, 
is  characterized  by  a  worst  case  computation  time  C,  and  a  period  T,.  Unless  mentioned 
otherwise  we  assume  that  a  periodic  task  must  finish  by  the  end  of  its  period.  Tasks  are 
independent  if  they  do  not  need  to  synchronize  with  each  other.  By  using  either  a  simple 
polling  procedure  or  a  more  advanced  technique  such  as  a  sporadic  server  [9],  the  schedul¬ 
ing  of  aperiodic  tasks  can  be  treated  within  the  rate  monotonic  framework.  In  each  case  C 
units  of  computation  are  allocated  in  a  period  of  T  for  aperiodic  activity.  However,  the  man¬ 
agement  and  replenishment  of  the  capacity  is  different  in  each  case. 


The  scheduling  of  independent  periodic  tasks  was  originally  considered  by  Liu  and  Layland 
[4].  The  scheduling  of  periodic  tasks  with  synchronization  requirements  can  be  addressed 
by  a  simple  extension  to  the  original  formula  as  follows  [6]: 


Theorem  1:  A  set  of  n  periodic  tasks  scheduled  by  the  rate  monotonic  algorithm 
will  always  meet  its  deadlines,  for  all  task  phasings,  if 


V  i,  \<i<n. 


C,  C2  (C.+S.) 

+  +._L.  i  <f(2^-l) 


h 


where  Bt  is  the  duration  in  whicn  task  tj  is  blocked  by  lower-priority  tasks.  This  blocking 
which  is  also  known  as  priority  inversion.  The  effect  of  this  blocking  can  be  modeled  as 
though  task  -tj’s  utilization  increased  by  an  amount  fi/T,.  Theorem  1  shows  that  the  dura¬ 
tion  of  priority  inversion  reduces  schedulability,  the  degree  of  processor  utilization  at  or  be¬ 
low  which  all  deadlines  can  be  met.  Priority  inversion  in  a  centralized  system  can  occur 
when  tasks  have  to  synchronize  and  have  common  critical  sections.  This  inversion  can  be 
controlled  by  a  priority  ceiling  protocol.  The  priority  ceiling  protocol  is  a  real-time 
synchronization  protocol  described  in  detail  in  Sha,  Rajkumar,  and  Lehoczky  [6].  Under  this 
protocol  there  are  no  mutual  deadlocks,  and  a  higher-priority  task  can  be  blocked  by  lower- 
priority  tasks  only  once.  As  a  result,  the  worst  case  duration  of  blocking  is  the  longest  critical 
section  that  may  block  a  task. 

A  tutorial  on  GRMS  that  contains  further  details  on  these  issues  is  given  in  Sha  and 
Goodenough  [5]. 
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3.  Distributed  System  Extensions  for  GRMS 

Scheduling  in  a  network  is  different  from  scheduling  in  a  centralized  environment.  In  a  cen¬ 
tralized  system,  the  centralized  scheduler  immediately  knows  of  all  resource  requests.  In 
some  networks,  distributed  scheduling  decisions  must  be  made  with  incomplete  information. 
From  the  perspective  of  any  particular  station,  some  requests  could  be  delayed  and  some 
may  never  be  seen,  depending  on  the  relative  position  of  the  station  in  the  network.  The 
challenge  is  to  achieve  predictability  under  these  circumstances. 

GRMS  theory  has  to  be  extended  to  address  this  challenge.  Certain  basic  concepts  such  as 
schedulability  and  preemption  need  to  be  revisited,  and  some  concepts  such  as  system  con¬ 
sistency  need  to  be  developed. 


3.1.  Extensions  to  Schedulability  Concept 

In  a  real-time  system  a  particular  activity  is  said  to  have  "met  its  deadline*  if  the  activity 
completes  by  its  deadline.  When  scheduling  tasks  on  a  processor,  each  task  is  said  to  have 
met  its  deadline  if  it  completes  execution  by  a  certain  time  before  the  end  of  its  period.  In  a 
communication  network,  the  delay  incurred  by  a  message  in  reaching  its  destination  is  the 
sum  of  the  transmission  delay  and  the  propagation  delay.  The  transmission  delay  is  the 
time  between  message  arrival  at  a  station  and  the  time  at  which  it  is  transmitted.  The  trans¬ 
mission  delay  can  be  treated  analogously  to  task  execution  on  a  processor.  However,  the 
propagation  delay  can  be  longer  than  packet  transmission  times  causing  the  transmission  of 
the  next  message  to  begin  before  a  particular  message  reaches  its  destination.  This  occurs 
in  networks  such  as  FDD!,  IEEE  802.6  distributed-queue  dual-bus  (DQDB),  and  even  IEEE 
802.5  token  rings  when  early  token  release  is  used.  It  is  therefore  useful  to  separate  trans¬ 
mission  delay  and  propagation  delay  and  consider  the  notion  of  transmission  schedulability 
[7].  A  set  of  messages  is  said  to  be  transmission  schedulable  (t-schedulable)  if  each  mes¬ 
sage  can  be  transmitted  before  its  deadline.  Satisfaction  of  the  end-to-end  deadline  of  the 
message  can  be  found  using  the  relation: 

End-to~End  Deadline  >  Transmission  Deadline  +  PropagationDelay 

For  example,  in  an  FDDl  network,  the  worst  case  propagation  delay  is  the  walk  time,  defined 
as  the  time  taken  by  a  single  bit  to  traverse  the  ring  if  no  station  on  the  ring  wanted  to 
transmit. 


3.2.  Preemption  Control 

From  the  user's  viewpoint,  a  certain  initial  delay  in  setting  up  a  periodic  connection  is  ac¬ 
ceptable.  However,  users  expect  a  steady  flow  of  information  once  the  connection  is  set  up, 
and  hence  require  that  C  packets  be  delivered  every  period  T.  We  will  discuss  the  need  for 
preemption  control  to  achieve  the  above  property. 
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Preemption  is  the  most  basic  concept  in  priority  scheduling.  Tasks  are  assigned  priorities 
according  to  some  algorithm  to  maximize  resource  (e.g.,  processor)  utilization.  It  has  been  a 
long  held  belief  that  the  idealized  form  of  priority  scheduling  is  to  achieve  instantaneous 
preemption,  i.e.,  whenever  a  high-priority  task  becomes  ready,  the  resource  is  immediately 
taken  away  from  any  lower-priority  task  and  given  to  the  high-priority  task. 

It  has  been  assumed  that  increasing  preemptability  always  leads  to  a  minimization  of  priority 
inversion,  and  that  priority  inversion  is  eliminated  if  a  higher-priority  task  can  always  preempt 
a  lower  priority  task.  However,  this  is  not  true  in  a  distributed  system.  In  a  distributed  sys¬ 
tem  there  can  be  special  situations  when  a  particular  preemption  increases  the  delay  experi¬ 
enced  by  lower-priority  connections,  but  does  not  reduce  the  worst  case  duration  of  priority 
inversion.  We  call  such  situations  over-preemption,  and  their  effect  is  to  reduce  the  schedul- 
able  utilization  of  the  network.  To  overcome  the  undesirable  effect  of  over-preemption,  a 
preemption  control  protocol  is  needed.  In  the  following,  we  use  a  dual-link  network  based  on 
the  IEEE  802.6  DQDB  [8]  as  an  example  to  introduce  the  two  aspects  of  our  preemption 
control  protocol,  namely  phase  control  and  rate  control.  Finally,  we  will  address  the  logical 
relation  between  preemption  control  and  the  priority  inversion. 


IEEE  802.6  DQDB  Operation 

The  IEEE  802.6  DQDB  MAC  [I]  protocol  specifies  a  pair  of  slotted  links  operating  in  opposite 
directions.  The  links  may  be  referred  to  as  Flink  and  RlinJc  respectively  as  shown  in  Figure  3- 1 . 


Figure  3-1 :  IEEE  802.6  DQDB  Network 

Fixed-length  slots  are  generated  by  slot  generators  of  the  corresponding  links.  Although  the  figure 
shows  slot  generators  as  separate  functional  units,  the  slot  generation  function  can  be  embedded  in 
stati  jns  at  the  end  of  the  links.  Each  station  is  able  to  transmit  and  receive  messages  on  both  links. 
The  selection  of  the  link  to  be  used  for  transmission  depends  on  the  physical  location  of  the  destina¬ 
tion.  Reservation  for  a  slot  on  the  Flink  is  made  on  the  Rlink  via  a  request  and  vice  versa. 

The  operation  of  the  protocol  is  based  on  a  single  busy  bit,  indicating  whether  the  slot  is  used  or  free, 
and  a  request  bit  per  slot  for  each  priority  level.  Four  priority  levels  arc  supported.  Each  priority  level 
represents  a  separate  access  queue.  A  station  wishing  to  transmit  at  a  certain  priority  on  Rink,  issues  a 
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request  in  a  slot  on  Rlink  by  setting  the  proper  request  bit.  It  also  places  its  own  request  into  its  access 
queue  at  the  correct  priority.  Each  station  on  seeing  a  request,  enqueues  it  in  its  access  queue  at  the 
correct  priority.  Every  station  on  seeing  a  free  slot  discards  the  top  request  from  its  highest  priority 
non-empty  access  queue,  because  the  slot  has  been  previously  reserved  by  another  station.  If  the  top 
request  is  the  station’s  request  then  it  transmits  in  the  slot  on  the  Flink  in  addition  to  removing  the 
request  from  its  access  queue.  The  access  queues  are  implemented  using  a  set  of  two  counters  for  each 
priority  level.  Details  may  be  found  in  [1]. 

The  current  IEEE  802.6  protocol  does  not  have  adequate  mechanisms  to  ensure  correct  operation  in  a 
real-time  distributed  scheduling  environment  (3).  As  a  result,  it  exhibits  unpredictable  behavior  under 
certain  conditions  [2,3].  The  distributed  scheduling  extension  to  GRMS  reported  in  this  paper  has 
been  the  result  of  studying  these  problems.  Finally,  IEEE  802.6  implements  only  4  priority  levels.  As 
we  will  see  in  Chapter  4,  this  also  results  in  a  reduction  of  schedulability.  However,  IEEE  802.6 
provides  high  bandwidth  over  a  long  distance  and  the  problems  mentioned  here  can  be  solved  by 
extensions  to  the  standard  [3]  or  by  the  development  appropriate  scheduling  abstractions. 
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Example  1 :  Consider  a  dual-link  network  with  three  stations,  as  shown  in  Figure  3-2. 
Let  the  delay  between  Sj  and  S2  be  10  slots  and  between  S2  and  S3  be  1  slot  as 
shown.  Let  Sj  and  S3  be  transmitting  as  follows:  Sj  has  a  low-priority  connection  that 
uses  100  slots  every  10000  slots.  S3  has  a  medium  priority  connection  that  wants  to 
transmit  in  1  slot  every  10  slots.  This  leads  to  a  slot  usage  pattern  as  shown.  Slots 
labeled  L  are  used  by  Slt  and  the  slot  labeled  M  is  used  by  S3.  Notice  that  5j  has 
released  an  empty  slot  so  that  S3  may  transmit  once  every  1 0  slots  are  it  requires. 
Now  let  S2  start  a  new  high-priority  connection  that  needs  to  transmit  in  1  slot  every  4 
slots.  Since  S2  s  request  has  higher  priority,  it  preempts  S3’s  request  in  S2  s  queue 
and  S2  will  transmit  in  the  unused  slots  that  were  meant  for  S3.  The  first  of  the  slots 
released  by  for  S2  will  take  20  units  of  time  after  S2  s  first  request  to  reach  S2. 
Until  this  time  since  S2  can  only  transmit  in  slots  meant  for  S3,  S2  can  transmit  only 
one  slot  in  10  which  is  less  than  it  needs.  As  a  result,  even  though  S3’s  connection  is 
interrupted,  S2  is  not  t-schedulable,  resulting  in  an  erratic  connection.  Therefore  the 
preemption  of  53's  request  is  a  form  of  over-preemption. 

To  correct  the  problem  in  the  above  example  we  need  to  prevent  Station  S2  from  using  slots 
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Priority  Priority 

Connection  Priority  L  H 1/4  Ml/10  slots 

100/10000  slots 


Figure  3-2:  Preemption  Control  Example 

released  for  station  S3.  This  means  that  S2  should  delay  its  slot  use  for  20  slot  times  after  its 
first  request,  which  is  the  round  trip  delay  between  S2  and  the  slot  generator  for  the  Flink. 
After  this  time,  slots  released  by  S1  in  response  to  S2’s  request  will  reach  S2 ■  This  is  tne 
phase  control  aspect  of  preemption  control. 

However,  phase  control  by  itself  is  insufficient.  During  the  20  unit  delay  5  cells  are  buffered 
at  S2.  After  the  20  unit  delay,  only  1  slot  in  4  will  be  released  for  use  by  S2.  Hence  S2  at¬ 
tempts  to  transmit  all  5  slots  as  soon  as  possible,  then  connection  from  S3  will  again  be 
disrupted  without  improving  S2  s  worst  case  end-to-end  latency.  Observe  that  the  20  unit 
delay  will  add  to  S2  s  worst-case  delay  irrecoverably.  After  the  connection  is  set-up,  the 
destination  expects  1  cell  every  4  slots;  therefore,  attempting  transmission  of  all  5  cells  as 
soon  as  possible  does  not  improve  S2’s  worst-case  performance.  Hence,  S2  should  only 
transmit  one  slot  every  4,  after  the  round-trip  delay  of  20  slot  times.  This  is  the  rate  control 
aspect  of  preemption  control.  With  phase  and  rate  control,  the  connections  for  both  S2  and 
S3’s  will  be  transmission  schedulable. 

Finally,  we  want  to  point  out  that  from  an  implementation  viewpoint,  the  preemption  control 
occurs  at  a  higher  layer  than  the  priority  queueing  mechanism.  Prioritized  packets  released 
by  preemption  control  protocol  into  the  MAC  layer  will  follow  usual  priority  queueing  rules  as 
in  a  centralized  system. 


3.3.  System  Consistency 

As  discussed  before,  stations  in  a  distributed  system  may  have  incomplete  or  delayed  infor¬ 
mation  of  the  system  state.  This  may  lead  to  inconsistent  views  of  the  system  state,  as  il¬ 
lustrated  by  the  following  example: 

Example  2:  Consider  three  stations  Sc,  Sb,  and  Sa,  in  a  dual-link  network  as  shown  in 
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Figure  3-3.  Suppose  Sb  enters  its  own  request  Rb  in  its  transmission  queue  and  then 
attempts  to  make  a  request  on  the  Riink.  Let  Sb  be  prevented  from  making  a  request 
on  Riink  by  higher-priority  requests  until  request  Ra  by  station  Sa  passes  by.  On  the 
request  stream,  Ra  precedes  Rb  while  in  Sb s  transmission  queue  Rb  precedes  Ra. 
After  the  requests  are  registered  in  station  Sc,  the  transmission  queue  of  Sc  will  have 
Ra  preceding  Rb,  which  is  inconsistent  with  the  queue  of  station  Sb. 


Figure  3-3:  Inconsistent  Station  Queues  in  an  IEEE  802.6  DQDB  Network 

To  address  this  problem  we  introduce  the  concept  of  system  consistency.  System  consis¬ 
tency  can  be  defined  as  follows:  In  a  distributed  system  it  is  possible  for  two  request  entries 
to  exist  in  multiple  queues.  For  example,  two  requests  can  simultaneously  exist  in  multiple 
station  queues  in  a  dual-link  network  [1].  A  system  is  said  to  be  consistent  if  and  only  if  the 
order  of  the  same  entries  in  different  station  queues  is  consistent  with  each  other.  For  ex¬ 
ample  in  a  dual-link  network,  if  request  Ri  and  request  R2  both  exist  in  queue  Qa  and  queue 
Qb,  and  ifR1  is  ahead  of  R2  in  Qa,  then  R{  must  also  be  ahead  of  R2  in  Qb. 

The  inconsistency  problem  can  lead  to  conflicts  between  distributed  scheduling  actions.  In¬ 
consistency  can  be  avoided  by  the  following  rule:  A  station  is  not  permitted  to  enter  its  re¬ 
quest  in  its  own  queue  until  it  has  successfully  made  the  request  on  the  link.  This  makes  the 
entries  in  each  queue  consistent  with  the  ordering  of  requests  on  the  link.  Therefore  all  the 
queues  will  be  consistent  with  each  other.  In  the  above  example,  station  Sb  cannot  enter  its 
request  in  its  queue  until  it  can  make  a  request  on  the  link.  Therefore,  Sb  s  request  will  be 
after  Sa's  request,  both  on  the  link  and  in  Sb  s  queue. 
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In  the  preceding  paragraphs  we  have  highlighted  fundamental  new  issues  in  distributed  real¬ 
time  system  design.  These  issues  are  intrinsic  to  a  wide  area  network  where  communication 
delays  are  long  and  scheduling  has  to  be  carried  out  in  parallel  by  distributed  stations  with 
partial  or  delayed  information.  Any  distributed  real-time  system  protocol  must  address  at 
least  some  of  these  issues. 

A  formal  description  of  the  above  concepts  is  described  as  a  coherent  reservation  protocol 
(CRP)  and  a  preemption-control  protocol  [8].  The  important  theoretical  result  of  this  work  is 
summarized  by  the  following  theorem: 

Theorem  1 :  For  a  given  a  set  of  periodic  connections  in  a  dual-link  network  that 
follows  CRP,  if  the  set  of  connections  is  schedulable  in  a  centralized  preemptive 
priority-driven  system  with  zero  (negligible)  propagation  delay,  then  the  set  of  con¬ 
nections  is  transmission  schedulable  in  a  dual-link  network. 

The  importance  of  this  theorem  is  that  even  in  a  wide  area  network  with  incomplete  infor¬ 
mation,  scheduling  decisions  can  be  made  as  though  it  is  a  centralized  system.  This  allows 
us  to  seamlessly  use  GRMS  in  the  analysis  of  such  systems. 
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4.  Scheduling  Abstractions 

GRMS  assumes  preemptive,  priority  scheduling.  Ideally,  each  distinct  period  corresponds  to 
a  distinct  priority  level.  However,  only  a  limited  number  of  priorities  can  be  supported  by 
hardware.  The  effect  of  limited  priority  levels  is  reduced  schedulability  as  illustrated  by  the 
following  diagram  [7]: 


1.0 


n 

cs 


x> 

© 

JZ 

o 

CO 


© 

> 

co 

© 


0.8 

0.6 

0.4 

0.2 


Number  of  priority  bits 


Figure  4-1 :  Schedulability  Loss  vs.  The  Number  of  Priority  Bits 

Figure  4-1  plots  the  schedulability  as  a  function  of  priority  bits,  relative  to  the  schedulability 
with  as  many  priority  levels  as  needed  under  the  condition  that  the  ratio  between  the  largest 
and  the  shortest  period  is  100,000  [7].  As  can  be  seen,  the  schedulability  loss  is  negligible 
with  8  encoded  priority  bits,  which  corresponds  to  256  priority  levels.  In  other  words,  the 
worst-case  schedulability  obtained  with  8  priority  bits  is  close  to  that  obtained  with  an  un¬ 
limited  number  of  priority  levels. 

To  use  GRMS  for  the  development  of  real-time  computing  systems,  we  would  like  to  use 
subsystems  that  support  the  use  of  GRMS  such  as  Futurebus+,  POSIX.4  and  Ada  9x.  How¬ 
ever,  we  may  have  to  use  some  components  that  do  not  support  GRMS.  In  this  case,  we 
need  to  develop  a  scheduling  abstraction  for  the  sub-system  so  that  it  can  be  treated  as  if  it 
supports  GRMS.  Although  the  scheduling  abstraction  allows  the  use  of  GRMS,  it  comes  at 
cost  of  reduced  schedulability  due  to  the  lack  of  direct  support.  With  scheduling  abstrac¬ 
tions,  we  can  provide  application  developers  a  consistent  scheduling  interface  that  allows 
them  to  develop  applications  as  if  every  sub-system  supports  GRMS.  In  the  following,  we 
demonstrate  the  creation  of  scheduling  abstractions  by  using  the  FDDI  timed  token  protocol 
as  an  example. 
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Fiber  Distributed  Data  interface 

FDDI  is  a  100  Mbit/s  Local/Metropolitan  Area  Network  that  has  gained  recent  popularity.  FDDI  is  a 
token  ring  protocol  that  uses  a  timed-token  access  method  [1].  In  a  token  rotation  media  access 
protocol,  stations  are  connected  to  form  a  ring.  All  messages  move  around  the  ring  and  are  repeated 
by  each  station  through  which  they  pass.  A  station  reading  its  own  address  as  the  destination  copies 
the  packet  and  then  passes  the  packet  to  the  next  station  in  the  ring.  Once  the  frame  reaches  the  source 
station,  it  is  removed  from  the  ring.  The  permission  to  transmit  is  granted  to  a  station  that  is  in 
possession  of  a  special  type  of  frame  called  a  token.  The  time  for  a  token  to  traverse  an  idle  ring  is 
called  the  walk  time ,  denoted  here  as  Wj. 

Under  this  protocol,  stations  on  the  network  choose  a  target  token  rotation  time  (TTRT).  A  station  in 
the  FDDI  protocol  can  transmit  in  either  synchronous  or  asynchronous  mode.  Each  station  is  allocated 
a  synchronous  capacity,  which  is  the  maximum  time  a  station  is  permitted  to  transmit  in  synchronous 
mode  every  time  it  receives  the  token.  Synchronous  capacities  of  each  station  are  restricted  to  a 
pie-allocated  fraction  of  (TTRT-Wj),  such  that  the  cumulative  synchronous  capacity  of  the  entire 
network  is  bounded  by  (TTRT-Wj).  When  a  station  receives  a  token,  it  first  transmits  its  synchro¬ 
nous  traffic  for  an  amount  of  time  bounded  by  its  synchronous  capacity.  Then  it  may  transmit 
asynchronous  traffic  only  if  the  the  time  since  the  previous  token  departure  from  the  same  station  is 
less  than  TTRT.  This  protocol  forces  the  token  to  rotate  at  a  speed  such  that  the  time  between  two 
consecutive  token  visits  is  bounded  by  2*TTRT  [2].  In  a  network  that  uses  only  synchronous  mode, 
time  between  consecutive  token  arrivals  is  bounded  by  one  TTRT. 

References 

1.  FDDI  Token  Ring  Media  Access  Control  --  ANSI  Standard  X3T9.5/83-16, 1987. 

2.  Sevcik,  K.  C.  and  Johnson,  M.  J.  "Cycle  Time  Properties  of  the  FDDI  Token  Ring  Protocol", 
IEEE  Transactions  on  Software  Engineering,  SE-13,  No.  3, 1987,  pp  376-385. 


A  real-time  scheduling  analysis  of  FDDI  for  the  case  of  one  periodic  connection  per  station 
has  been  developed  [2],  In  this  paper,  using  the  normalized  proportional  allocation  scheme 
in  Agrawal,  Chen,  Zhao,  and  Davari  [2],  we  create  a  scheduling  abstraction  when  there  are 
more  than  one  periodic  connections  per  station.  In  the  development  of  ihis  abstraction,  we 
need  to  consider  priority  granularity,  priority  inversion,  system  consistency,  pre-emption  con¬ 
trol,  and  transmission  schedulability. 

In  an  FDDI  network  that  uses  only  synchronous  mode,  each  station  Si  can  transmit  once 
every  TTRT  for  an  amount  equal  to  an  assigned  synchronous  capacity  Therefore  the 
resource  (network)  is  allocated  to  stations  in  a  time-division  multiplexed  fashion,  with  no  pri¬ 
ority  between  stations.  As  an  example,  Figure  4-2  shows  the  transmission  sequence  from 
three  stations  5j,  S2,  and  S3,  allocated  bandwidths  of  H j,  H2,  and  //3  respectively.  Hence  it 
may  appear  from  Figure  4-1  that  the  schedulable  utilization  is  zero.  However,  the  order  of 
message  transmissions  from  each  station  may  be  prioritized.  If  each  station  implements 
sufficient  priorities  to  use  its  dedicated  portion  of  the  bandwidth,  then  there  is  no 
schedulability  loss  within  stations.  However,  since  there  is  no  priority  arbitration  between 
stations,  a  station  with  the  token  can  transmit  lower-priority  messages  even  when  high- 
priority  messages  are  waiting.  In  this  sense,  it  is  a  bounded  priority  inversion  and  limits  the 
schedulable  utilization  of  the  network. 
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Figure  4-2:  Transmission  sequence  from  three  stations  in  synchronous  mode 


A  message  must  satisfy  these  conditions  to  be  schedulable  in  an  FDD!  network  that 
operates  in  synchronous  mode: 

•  Each  connection’s  period  Tt  must  satisfy  the  relation  T>TTRT. 

•  Each  station  S,.  must  be  allocated  enough  synchronous  capacity  Hi  so  that  each 
connection  in  the  station  is  t-schedulable. 

A  simple  scheme  for  synchronous  bandwidth  allocation  is  the  normalized  proportional 
scheme  suggested  by  Agrawal,  Chen,  Zhao,  and  Davari  [2].  The  total  available  bandwidth 
on  each  token  rotation  is  given  by  ( TTRT-Wr ).  The  normalized  proportional  allocation 
scheme  gives  each  station  a  fraction  of  this  bandwidth,  consistent  with  that  station’s  con¬ 
tribution  to  the  total  network  utilization.  Therefore,  the  bandwidth  Hi  allocated  to  station  Si  is 
given  by: 

ui 

Ht  =  -jj  (JTRT-Wj) 


Ut  is  the  network  bandwidth  utilized  by  station  St  and  U=UX+  .  .  .  +Un.  TTRT  is  the  target 
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token  rotation  time  and  WT  is  the  walk  time.  The  following  example  demonstrates  this  for¬ 
mula: 


example  3:  Suppose  we  have  three  periodic  messages  xv  x2  and  x3,  to  be  trans¬ 
mitted  from  three  stations  Sv  S2,  and  S3  respectively,  on  an  FDDI  network.  Let  the 
TTRT  be  8  and  the  walk  time  Wrbe  1. 

•  Message t.,:  Cx  =  7\TX  =  100 

•  Message  x2:  C2  - 10;  T2  -  145 

•  Message  x2:  C3  =  15;  T2  =  150 

where  C,  is  the  transmission  time  and  T,  is  the  period  of  message  Xj.  The  utilization 
of  the  above  message  set,  U  =  0.239.  Applying  the  above  formula,  H x  =  2.05,  H2  = 
2.02,  //3  =  2.93. 

Consider  the  application  of  GRMS  to  an  FDDI  network  that  transmits  only  in  synchronous 
mode.  Let  synchronous  bandwidths  be  allocated  to  stations  by  some  technique  such  as  the 
above  formula.  Notice  that  if  each  station  uses  its  allocated  synchronous  bandwidth,  the 
actual  token  rotation  time  (TRT)  (time  between  consecutive  token  arrivals  at  the  station)  at¬ 
tains  its  maximum  possible  value  TTRT.  However  if  any  station  does  not  use  its  allotted 
bandwidth,  then  the  token  rotates  faster  than  TTRT. 

Consider  any  station  St  in  the  network.  Let  the  capacity  allocated  to  the  station  be  Hv  Let 

the  station  be  a  source  of  periodic  messages  xu  =  (CU,TU),  x2i~(C2i,T2i) . 

xni  =  (Cm,Tw).  The  station  can  transmit  for  up  to  Hi  units  of  time,  every  TTRT. 

GRMS  can  be  applied  to  scheduling  messages  in  this  station  as  follows:  In  addition  to  the 
message  set  that  must  be  transmitted,  the  station  can  be  considered  to  have  another 
"message"  to  transmit.  The  period  of  this  message  is  the  actual  token  rotation  time  TRT. 
The  "transmission  time"  of  this  message  is  given  by  ( JRT-H ,).  We  refer  to  this  task  as 
Token  Rotation  Message  xtr  The  token  rotation  message  represents  the  time  that  the  sta¬ 
tion  is  prevented  from  transmitting  every  token  rotation.  The  longest  period  of  the  token 
rotation  task  is  TTRT.  A  necessary  condition  for  schedulability  is  that  the  period  of  the 
highest  frequency  message  must  be  longer  than  TTRT.  Note  that  the  actual  token  rotation 
time  can  be  shorter  than  TTRT  if  other  stations  do  not  completely  use  their  synchronous 
allocations.  However,  station  St  is  guaranteed  Hi  amount  of  bandwidth  in  every  token  rota¬ 
tion  cycle.  Hence,  if  connections  in  S,-  are  schedulable  in  the  longest  cycle  (TTRT),  they  are 
also  schedulable  in  any  shorter  cycle. 

The  FDDI  scheduling  abstraction  described  above  has  two  levels  of  scheduling.  At  the 
higher  level,  the  resource  capacity  is  allocated  between  applications  in  a  time  division  mul¬ 
tiplexed  (TDM)  manner.  Within  each  allocation,  the  resource  schedules  activities  using 
GRMS.  This  abstraction  can  be  directly  used  for  sharing  a  processor  between  real-time  and 
non-real-time  applications.  In  this  case,  we  create  a  cycle  and  allocate  portions  of  the  cycle 
to  real-time  and  non-real-time  activities  respectively.  Observe  that  similar  to  FDDI,  the  cycle 
has  to  be  no  greater  in  length  than  the  period  of  the  highest  frequency  real-time  task.  The 
TDM  switching  overhead  can  be  treated  similar  to  the  walk-time  in  the  FDDI  case. 
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Finally,  consistency  between  station  queues  is  not  an  issue  since  each  station  has  its  own 
dedicated  bandwidth.  Preemption  control  is  still  necessary  when  a  new  connection  has  to 
be  established  and  synchronous  bandwidth  has  to  be  reallocated.  In  this  case,  the  connec¬ 
tion  should  first  exercise  phase  control  and  avoid  transmitting  until  bandwidth  is  allocated  for 
the  new  connection.  Furthermore,  the  new  allocation  should  exercise  rate  control  and  not 
transmit  all  its  accumulated  packets.  Finally,  the  concept  of  transmission  schedulability  is 
directly  applicable  in  this  abstraction  as  will  be  discussed  in  Chapter  5. 
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5.  Example  Application 

We  have  reviewed  the  theory  for  applying  GRMS  to  a  distributed  system  and  developed  a 
scheduling  abstraction  for  FDDI,  the  only  component  in  our  system  that  does  not  directly 
support  GRMS.  This  example  illustrates  how  to  apply  GRMS  extensions  to  schedule  a  distri¬ 
buted  system  that  consists  of  both  real-time  control  activities  and  multimedia  communica¬ 
tion.  This  example  will  illustrate  the  following  concepts: 

•  Management  of  end-to-end  deadline  by  partitioning  into  subsystem  deadlines 

•  Sharing  of  a  processor  by  both  real-time  and  non-real-time  activities 

•  Application  of  the  FDDI  scheduling  abstraction 

•  Management  of  propagation  delay  and  jitter 


5.1.  Description  of  Example 

Consider  the  system  in  Figure  1-1.  It  is  built  around  an  FDDI  network.  Station  S5  is  a  mul¬ 
tiprocessor  built  around  a  Futurebus+  backplane.  The  control  processor  in  station  S5 
receives  a  variety  of  sensor,  audio  and  video  information,  from  both  local  and  remote 
sources.  There  exists  an  end-to-end  deadline  from  each  source  to  the  control  processor. 
The  end-to-end  deadline  is  the  permissible  delay  between  the  instant  the  information  is  cap¬ 
tured  at  the  sensor,  to  the  instant  the  system  outputs  its  response  to  the  information.  We 
assume  that  the  priority  ceiling  protocol  is  used  for  task  synchronization.  A  high-level  de¬ 
scription  of  the  data  flow  in  this  example  system  is  as  follows.  All  time-units  in  the  following 
example  are  milliseconds. 

Traffic  Across  Network: 

•  Station  Sy  Remote  sensor  information  is  captured  and  transmitted  ac  oss  the 
network  to  the  control  processor  in  Station  S5.  The  station  transmits  1 .0  Mbits  of 
data  every  150 .  Also  sends  5  Mbits  of  data  every  100  to  Stations  2  and  4. 

•  Station  Sy  Video  monitoring  station  captures  audio  and  video  information  and 
transmits  it  over  the  network  to  the  control  processor  in  Station  5.  The  required 
end-to-end  deadline  is  100  .  The  video  source  is  1024x768  pixels  per  frame  at 
24  bits/pixel  and  30  frames/sec.  Three  CD  quality  audio  channels  sampled  at 
44.1  Khz  with  32  bits/sample  are  also  transmitted. 

Workload  In  Station  S5 

•  Signal  Processor  Tasks :  The  local  sensor  takes  an  observation  every  40.  To 
reduce  unnecessary  bus  traffic  the  signal  processing  task  processes  the  signal 
and  averages  it  every  4  cycles  before  sending  it  to  the  tracking  processor. 

•  The  tracking  processor  tasks:  After  the  task  executes  it  sends  the  result  to  the 
control  processor  with  a  period  of  160.  Task  x3  on  the  control  processor  uses 
this  tracking  information.  In  addition,  the  end-to-end  latency  of  the  pipeline  of 
data  flow  from  the  sensor  to  the  control  processor  should  be  no  more  than  785. 

•  Control  Processor  Tasks:  The  control  processor  has  additional  periodic  and 
aperiodic  tasks  which  must  be  scheduled. 
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•  Aperiodic  event  handling  with  an  execution  time  of  5  and  an  average 
inter-arrival  time  of  100. 

•  A  periodic  task  for  handling  local  feedback  control  with  a  computation  re¬ 
quirement  and  a  given  period.  Task  x2:  C2  =  78;  T2  =  150; 

•  A  periodic  task  that  utilizes  the  tracking  information  received.  Task  Xo.’  C, 

=  30;  V  160; 

•  A  periodic  task  responsible  for  reporting  status  across  the  network  with  a 
given  computation  time  and  period.  Task  x4:  C4  =  10;  T4  =  300; 

•  In  addition  there  is  an  existing  non-real-time  application  which  requires 
9%  of  the  CPU  cycles  to  meet  its  performance  goals. 

5.1 .0.1.  Partitioning  of  End-to-End  Deadlines 

When  a  message  is  sent  within  a  station,  it  can  be  implemented  by  passing  a  message 
pointer  to  the  receiving  task  and  hence  can  be  treated  as  any  other  OS  overhead.  However, 
when  a  message  is  sent  outside  the  processor  boundary,  an  integrated  approach  to  assign¬ 
ing  message  and  task  deadlines  needs  to  be  developed.  Consider  the  situation  in  Figure 
1-1: 

•  The  sensor  takes  an  observation  every  40. 

•  The  signal  processing  task  processes  the  signal,  averages  the  result  every  4 
cycles,  and  sends  it  to  the  tracking  processor  every  160. 

•  The  tracking  processor  task  executes  with  a  period  of  160.  It  then  sends  a 
message  to  the  control  processor 

•  Task  x3  on  the  control  processor  that  uses  the  tracking  information  has  a  com¬ 
putational  requirement  of  30,  and  a  period  of  160  as  given  above.  Recall  that  in 
order  for  the  control  processor  to  respond  to  a  new  observation  by  the  sensor, 
the  end-to-end  laten  j  needs  to  be  less  than  785. 

A  guiding  principle  in  partitioning  the  deadline  is  to  try  and  minimize  the  impact  of  workload 
changes  in  a  subsystem  and  to  contain  the  impact  within  the  subsystem.  If  each  resource  is 
allowed  a  full  period  delay,  each  subsystem  can  be  analyzed  as  if  it  is  an  independent 
resource.  An  alternate  approach  is  to  determine  the  completion  time  at  each  resource  and 
the  end-end  delay  is  the  sum  of  the  completion  times.  This  approach  is  more  sensitive  to 
workload  changes. 

Finally,  when  a  task  is  scheduled  on  multiple  resources  in  series,  it  may  arrive  at  the  next 
resource  well  before  its  deadline  on  the  current  resource.  If  we  schedule  the  task  immedi¬ 
ately  upon  its  arrival,  it  will  create  the  jitter  problem  as  illustrated  below: 

Example  4:  Consider  two  resources  Rx  and  R2  connected  in  series.  Assume  task  t1 
has  a  period  of  1 0.  Furthermore,  x1  is  allocated  a  full  period  on  each  resource,  and  it 
uses  each  of  the  two  resources  for  5  units.  Let  task  x2  use  only  the  second  resource 
for  3  units,  with  a  period  of  12  units.  Let  the  first  instance  of  x1  arrive  at  R{  at  t  =  0, 
and  let  the  first  instance  of  x2  arrive  at  R2  at  t  =  1 0. 

Suppose  the  first  instance  of  x1  at  resource  completes  its  execution  and  arrives  at 
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/?2  at  t  =  10.  Since  x1  has  higher  priority  than  that  of  t2,  it  will  immediately  use  R2, 
preempting  t2.  Observe  that  the  second  instance  of  x,  arrives  at  R j  at  t  =  10.  Sup¬ 
pose  this  instance  is  not  delayed  at  Rv  Then  at  t  =  1 5,  the  second  instance  of  t-j  will 
begin  to  use  R2,  further  preempting  x2’s  starting  time  to  t  =  10.  As  a  result  x2  will 
miss  its  deadline  at  t  =  1 2. 

The  jitter  effect  can  be  easily  controlled  by  a  simple  rule:  a  task  becomes  ready  to  use  a 
resource  at  the  beginning  of  a  new  period.  Using  this  rule  in  the  above  example,  the  second 
instance  of  xt  will  be  buffered  and  will  become  ready  to  use  R2  only  at  t  =  20.  In  the  follow¬ 
ing  discussion  we  assume  that  this  rule  is  enforced.  It  should  be  noted  that  jitter  control  is  a 
special  case  of  the  phase  control  aspect  of  preemption  control. 

The  steps  involved  in  deadline  partitioning  are  as  follows:  First  we  try  to  use  the  rate 
monotonic  priority  assignment.  Since  rate  monotonic  analysis  guarantees  end-of-period 
deadlines,  we  assume  that  the  end-to-end  delay  is  the  sum  of  the  period  for  each  resource. 
Since  the  signal  processor  averages  four  cycles,  each  40  units  long,  its  delay  is  up  to  160. 
Each  of  the  other  resources  has  a  delay  up  to  one  period  which  is  1 60.  That  is,  the  total 
delay  using  rate  monotonic  scheduling  is  bound  by  4*40  +  160  +  160  +  160  +  160  =  800.  If  it 
were  less  than  the  allowable  delay  then  rate  monotonic  priority  assignment  could  be  used 
for  all  the  resources. 

However  the  specified  maximum  allowable  latency  is  785.  Hence  we  may  need  to  use 
deadline  monotonic  scheduling  for  at  least  some  of  the  resources  in  the  path.  From  a  soft¬ 
ware  engineering  viewpoint,  it  is  advisable  to  give  a  full  period  delay  for  global  resources 
such  as  the  bus  or  the  network  since  their  workload  is  more  susceptible  to  frequent 
changes.  Since  there  are  two  bus  transfers  involved  we  attempt  to  assign  a  full  period  to 
each.  We  also  attempt  to  assign  a  full  period  to  the  signal  and  tracking  processors.  Hence 
the  required  completion  time  of  the  control  processor  task  x3  should  be  no  greater  than 
785-4x(160)=  145.  We  therefore  assign  a  deadline  of  145  to  control  processor  task  x3. 


Deadline  Monotonic  Scheduling 

The  deadline  monotonic  scheduling  algorithm  is  a  generalization  of  the  rate  monotonic  scheduling 
algorithm.  Rate  monotonic  scheduling  assumes  that  the  deadline  of  a  periodic  task  is  at  the  end  of 
each  period.  That  is,  the  period  represents  the  window  of  time  within  which  a  task  must  initiate  and 
complete  its  execution.  Liu  and  Layiand  proved  that  it  is  optimal  to  give  tasks  with  narrower 
windows  higher  priorities.  They  referred  to  this  priority  assignment  method  as  the  rate  monotonic 
scheduling  algorithm  f2]. 

However,  a  task  may  have  its  deadline  before  the  end  of  its  period,  resulting  in  a  window  narrower 
\  than  its  period.  Leung  and  Whitehead  proved  that  it  is  still  optimal  to  assign  higher  priorities  to  tasks 

with  narrower  windows.  They  referred  to  this  priority  assignment  method  as  the  deadline  monotonic 
scheduling  algorithm  [1]. 
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5.1. 0.2.  Scheduling  Tasks  on  the  Control  Processor 

We  will  concentrate  on  scheduling  analysis  of  tasks  in  the  control  processor  using  the  com¬ 
pletion  time  test.  Scheduling  the  backplane  and  other  processors  is  similar.  First,  we  need 
to  create  two  virtual  processors  to  separate  the  real-time  and  non-real-time  applications.  We 
select  the  length  of  the  TDM  cycle  to  be  the  same  as  the  shortest  period  among  the  real- 
time  tasks,  1 00,  and  the  virtual  processor  switching  overhead  to  be  0.5.  Out  of  1 00,  9  will  be 
allocated  to  non-real-time  activities  and  90  to  real-time  virtual  processor,  with  one  unit  lost  in 
switching  overhead. 


Completion  Time  Test 

The  completion  time  test  is  a  faster  algorithm  to  test  the  schedulability  of  a  task  set  than  the  schedul¬ 
ing  point  test  that  is  usually  used.  It  is  based  on  the  critical  zone  theorem  [1]  which  states  that  if  a 
task  meets  its  first  deadline  even  when  all  higher-priority  tasks  become  ready  at  the  same  time  then  it 
can  meet  all  future  deadlines.  Consider  any  task  tn  with  a  period  rn,  deadline  D^Tn,  and  compu¬ 
tation  Cn.  Let  tasks  x,  to  xn_j  have  higher  priorities  than  x„.  Note  that  at  any  time  f,  the  total 
cumulative  demand  on  CPU  time  by  these  n  tasks  is: 


The  term  f  1/tA  represents  the  number  of  times  task  x-  arrives  in  time  t  and  therefore  t/T~\ 
represents  its  demand  in  time  t.  For  example,  let  7",  =  10,  Cj  =  5  and  /  =  9.  Task  Xj  demands  5  units  of 
execution  time.  When  t  =  11,  task  Xj  has  arrived  again  and  has  a  cumulative  demand  of  10  units  of 
execution.  Suppose  that  task  xn  completes  its  execution  exactly  at  time  t  before  its  deadline  £>n.  This 
means  that  the  total  cumulative  demand  from  the  n  tasks  up  to  time  /,  Wn(t),  is  exactly  equal  to  t,  that 
is,  lFn(t)  -t.A  technique  for  finding  this  time  is  given  in  Figure  5-1. 


Set  fg  Cj 

*1  W£t g); 

*2«-WW; 


tk  «-  W£tk_i); 

Stop  when  (W£t0  =  **) 

Figure  5-1 :  Finding  minimum  t,  where  Wj(t)  =  t 
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Example  5:  Consider  a  task  set  with  the  following  independent  periodic  tasks: 

•  Task  x,:  Cj  =  20;  Tj  =  100;  Dx  =  100; 

•  Task  ^2'  C2  =  90 ;T2-  145;  D2  =  145; 

Task  Xj  is  clearly  schedulable.  The  schedulability  of  ^  can  be  tested  as  follows: 

r0  =  C1  +  C2  =  20  +  90  =  1 10 

/1  =  lV2(f0)  =  2C1  +  C2  =  40  +  90=  130 

r2  =  W2(fj)  =  2Cj  +  C2  =  40  +  90=  130  =  fj  Hence  task  Xj  finishes  at  130  and  is 
schedulable. 
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Let  the  real-time  task  set  in  the  control  processor  execute  on  the  real-time  virtual  processor. 
The  effect  of  the  TDM  cycle  spent  in  non-real-time  and  overhead  processing  can  be 
modeled  as  a  high-priority  task  with  a  period  of  100  and  execution  of  10.  Consider  the  re¬ 
quirement  for  aperiodic  event  handling  with  an  execution  time  of  5  and  an  average  inter¬ 
arrival  time  of  100.  We  create  an  equivalent  sporadic  server  task  with  10  units  execution  and 
a  duration  of  100  which  has  the  highest  priority.  A  simple  approximate  analysis  consists  of 
two  parts. 

•  First,  the  aperiodic  arrives  during  the  real-time  virtual  processor  operation  with 
90%  probability.  Since  we  have  allocated  twice  the  average  required  band¬ 
width,  we  assume  that  the  probability  of  an  aperiodic  arriving  when  there  is  no 
server  capacity  is  negligibly  small.  Together  with  the  fact  that  the  aperiodic  task 
has  the  highest  priority,  we  can  use  a  a  simple  M/D/1  queueing  formula.  We 
have  the  following  result  [3]: 

.W  =  _B!L_  +  C=  5.132 
2(l-p) 

•  where  p  =  5/100,  the  utilization  by  the  aperiodic  task,  and  C  =  5. 

•  Secondly,  with  1 0%  probability  the  aperiodic  message  arrives  during  the  non- 
real-time  virtual  processor  operation.  Since  the  average  aperiodic  inter-arrival 
time  is  ten  times  longer  than  the  duration  of  the  non-real-time  virtual  processor, 
we  assume  that  at  most  one  aperiodic  message  can  arrive  when  the  virtual 
processor  is  executing.  In  this  case  the  aperiodic  message  must  wait,  on 
average,  for  half  the  duration  of  the  non-reai-time  processor  including  switching 
overhead.  In  this  case  the  response  time  of  the  aperiodic  message  is 
5+5.132=10.132. 

Finally,  considering  both  cases,  the  response  time  of  the  aperiodic  task  is 
0.9x5.132+0.1x10.132=5.632.  It  is  important  to  note  that  the  analysis  of  aperiodic  tasks  is 
generally  complex  and  may  require  simulation. 
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Since  the  TDM  cycle  and  sporadic  server  have  the  same  period,  they  may  be  considered  as 
a  single  task;  i.e.,  Task  x^  C-,  =  20;  T,  =  100.  Therefore,  tasks  on  the  control  processor  are 
as  shown  in  Table  5-1 : _ 


Task 

C 

T 

D 

20 

100 

100 

78 

150 

150 

30 

160 

145 

10 

300 

300 

Table  5-1 :  Tasks  on  the  Control  Processor 


Let  tasks  xv  x2,  and  x3  share  several  data  structures  guarded  by  semaphores.  Suppose  the 
duration  of  critical  sections  accessing  shared  data  structures  are  bounded  by  10  units.  Sup¬ 
pose  the  priority  ceiling  protocol  is  used.  Since  the  priority  ceiling  protocol  is  assumed  to  be 
implemented,  higher-priority  tasks  are  blocked  at  most  once  for  10  by  lower-priority  tasks. 

We  now  check  whether  or  not  x3  completes  within  145  under  rate  monotonic  priority  assign¬ 
ment.  Under  rate  monotonic  assignment,  Xj  and  x2  have  higher-priority  than  x3.  Hence,  the 
completion  of  x3  can  be  calculated  using  the  completion  time  test  as  follows: 

Iq  =  Cj  +  C2  +  C-j  =  20  +  78  +  30  =  128 

fj  =  ^3(^0) =  +  C2  +  C3  =  40  +  78  +  30  —  148 


W3(rj)  =  2  C,  +  C2  +  C3  =  148  =/, 


Therefore  the  completion  time  of  x3  is  148,  which  is  later  than  the  required  completion  time 
of  145.  In  order  to  meet  the  deadline  of  145  imposed  by  the  maximum  allowable  latency 
requirement  of  the  previous  section,  we  use  the  deadline  monotonic  priority  assignment. 
This  makes  task  x3’s  priority  higher  than  task  x2’s  priority,  since  x3  has  the  shorter  deadline. 

Under  this  priority  assignment,  the  schedulabiiity  of  each  task  can  be  checked  as  follows: 
Task  x1  can  be  blocked  by  lower-priority  tasks  for  10  ,  i.e.,  B1  =  10.  The  schedulabiiity  test 
for  task  xt  is  a  direct  application  of  Theorem  1 : 

C  B 

J.  +  J.  =  0.2  +  0.1=  0.3  <  l(21^1— 1)  =  1.0 

T\  M 

Therefore,  task  x1  is  schedulable.  Task  x3  is  the  second  highest  priority  task.  Since  x3  has  a 
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deadline  shorter  than  its  period,  the  schedulability  test  for  x3  can  be  checked  as  follows.  Let 
£3  =  (T3  -  D3).  In  the  schedulability  test  of  x3,  the  utilization  of  task  x2  does  not  appear, 
since  t2  has  a  lower-priority  and  does  not  preempt  x3.  Because  x2  has  a  lower  priority,  its 
critical  section  can  delay  x3  by  10  .  Therefore  B3  =  10. 

C  C  E  B 

_I  +  -1  +  J.  +  J.  =  0.2+0.188+0.094+0.0625  =  0.545  <2(21^2— 1)  =  0.828 
T\  r3  r3  r3 

Now  consider  the  third  highest  priority  task  x2.  From  the  view  point  of  the  rate  monotonic 
assignment,  the  deadline  monotonic  assignment  is  a  "priority  inversion".  Therefore  in  the 
schedulability  test  for  task  x2,  the  effect  of  blocking  has  to  include  x3's  execution  time.  The 
blocking  time  is  B2  =  C3+0.  The  zero  indicates  that  there  can  be  no  lower-priority  task  block¬ 
ing  x2: 

Ci  Co  -Bo 

—+—+—=  0.2+0.52+0.2  =  0.92  >  2(2I^-1)  =  0.828 
T\  T2  T2 

The  schedulability  test  of  Theorem  1  fails  for  x2.  The  schedulability  of  x4  can  be  checked  by 
the  following  simple  test  since  there  is  neither  blocking  or  deadline  before  its  end  of  period: 

c  c  c  c 

—  +  —  +  —  +  —  =  0.2+0.52+0.188+0.033=  0.941  >  4(2^— 1)  =  0.757 
Tl  T2  T3  TA 

Note  that  the  schedulability  test  of  Theorem  1  fails  for  both  tasks  x2  and  x4.  To  determine 
their  schedulability  we  use  the  completion  time  test.  Since  x1  and  x3  must  execute  at  least 
once  before  x2  can  begin  executing,  the  completion  time  of  x2  can  be  no  less  than  128: 

/q  =  C|  +  C2  +  B2  =  20  +  78  +  30  =  128 

However,  x1  is  initiated  one  additional  time  in  the  interval  (0,128).  Taking  this  additional 
execution  into  consideration,  W2(128)  =  148: 

*1  -  ^0)  *  2Cj  +  C2  +  B2  =  40  +  78  +  30  =  148 

Finally,  we  find  that  W2(148)=148  and  thus  the  minimum  time  at  which  W2(t)  =  t  is  148.  This 
is  the  completion  time  for  x2.  Therefore  x2  completes  its  first  execution  at  time  1 48  and 
meets  its  deadline  of  150: 

W2{tx)  =  2Cj  +  C2  +  B2  =  40  +  78  +  30  =  148 

Similarly  we  can  check  the  schedulability  of  task  x4  using  the  completion  time  test.  It  turns 
out  to  be  schedulable. 
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5.2.  Scheduling  Messages  on  FDDI 

The  messages  that  exist  on  the  FDDI  network  are  as  follows: 

Station  S 1:  Sensor  data  is  collected  and  stored.  Every  150,  the  station  transmits  1.0  Mbits  of 
data.  Let  the  station  also  transmit  5  Mbits  of  data  every  100  to  Stations  S2  and  S4. 

Station  3:  Video  information:  The  required  end-to-end  deadline  is  assumed  to  be  100.  As  an 
example  we  assume  a  video  source  of  1024x768  pixels  per  frame  with  24  bits/pixe!  at  30 
frames/second,  compressed  with  a  ratio  of  16:1.  There  also  exist  3  channels  of  CD  quality 
audio.  Each  channel  is  sampled  at  44.1  Khz  with  32  bits/sample.  The  end-to-end  deadline 
for  audio  is  also  100. 

Consider  scheduling  of  messages  at  Station  3.  We  need  to  partition  the  end-to-end  dead¬ 
lines  into  subsystem  deadlines.  The  resources  that  need  to  be  scheduled  along  the  path 
between  the  source  and  the  control  processor  are:  the  source  interface  processor,  the  net¬ 
work,  the  destination  network  interface,  the  backplane,  and  the  control  processor  itself.  As 
discussed  in  Chapter  5.1. 0.1,  the  simplest  way  to  partition  the  end-to-end  deadline  is  to  al¬ 
low  a  delay  up  to  a  period  on  each  resource. 

First  consider  the  video  task.  Its  natural  period  at  30  frames/sec  is  33.  If  we  spend  an  entire 
period  on  each  of  the  five  resources,  the  end-to-end  delay  will  exceed  the  limit  of  100. 
Hence,  we  transform  the  sending  period  to  60  Hz,  i.e.,  we  send  half  a  frame  every  16.5.  For 
the  resolution  given  above  this  works  out  to  no  more  than  6  of  transmission  time  every 
period  of  1 6.5. 

Now  consider  the  audio  task.  Its  natural  period  is  roughly  one  sample  every  22 
microseconds.  This  period  is  too  short  for  the  network  as  well  as  the  packetization  proc¬ 
essing.  Hence  we  transform  the  transmission  period  to  11;  that  is,  we  accumulate  500 
samples  every  1 1  units  for  each  of  the  three  sources.  This  bundling  results  in  efficient  net¬ 
work  utilization,  but  requires  the  destination  to  buffer  and  regulate  the  delivery  of  the  voice 
packets  at  the  source  frequency.  This  yields  no  more  than  0.5  of  transmission  time  every 
period.  The  end  to  end  delay  over  5  resources  will  be  no  more  than  55. 

Each  source  of  traffic  first  has  to  packetize  the  information.  The  schedulability  analysis  of 
the  tasks  running  on  the  source  network  interface  processor  is  simpler  than  the  analysis  of 
the  control  processor  tasks  since  there  is  no  complex  data-sharing  between  tasks.  Hence 
we  will  omit  the  analysis  and  assume  that  it  is  schedulabie. 

Let  the  TTRT  be  8  and  let  the  walk  time  WT  be  1.  The  approach  to  scheduling  traffic  on 
FDDI  is  as  follows.  Consider  the  scheduling  of  messages  in  a  particular  station  5,-  with  al¬ 
lotted  synchronous  bandwidth  H ,.  Therefore,  the  station  can  transmit  up  to  Hi  every  TTRT. 
As  discussed  in  Chapter  4,  this  can  be  treated  as  having  another  high-priority  task  with  mes¬ 
sage  transmission  time  (TTRT  -  //,•)  and  period  TTRT.  We  refer  to  this  task  as  Token  Rota¬ 
tion  task  ^  Using  this  framework,  schedulability  of  traffic  at  each  station  can  be  independ¬ 
ently  analyzed. 
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Let  us  apply  the  above  technique  to  messages  at  Station  3.  Let  H3  =  4,  then  x,r  * 
((^=8-4=4,  Tv  =  8).  The  message  set  for  Station  3  is  then: 

•  Token  Rotation  Message  x^:  Ctf3  =  4;  Ttr3  =  8 

•  Audio  Message  x13:  C13  =  0.5;  T13  =  1 1 

•  Video  Message  x23:  C23  =  6;  T23  =  1 6.5 

The  schedulability  of  this  message  set  can  be  checked  using  the  completion  time  test.  The 
token  rotation  message  is  obviously  schedulable.  Consider  completion  of  message  x^ 

r0  =  Cfr3  +  ^13  =  4  +  0.5  =  4.5 

'i  =  W13(x0)  =  C^  +  C13  =  4  +  0.5  =  4.5 

Hence  x13  is  schedulable.  Consider  completion  of  message  x23: 

Xq  =  Ctrl  "f"  C13  "f"  C23  =  4  +  0.5  +  6  =  10.5 

h  =  W23^0>  =  1Ctr3  +  Cn  =  8  +  0.5  +  6  =  14.5 

r2  — 1^23^1^ =  2^'tr3  "**  2Cj3  =  8  +  1.0  +  6  =  15.0 

r3  =  ^23(^2)  ~  2^tr3  2C13  =  ^  1-0  +  6  =  15.0  =f2 

Hence  x23  is  schedulable. 

Similarly,  we  can  test  the  schedulability  of  messages  at  Station  1 .  If  Station  1  is  allotted  a 
synchronous  bandwidth  of  Hx  =  3,  the  message  set  at  Station  1  can  be  written  as: 

•  Token  Rotation  Message  xtr1 :  =  5;  Ttr3  =  8; 

•  Cjj  =  10;  Tjj  =  100; 

•  x2-j :  C2j  =  15;  T2j  =  1 50; 

Note  that  this  message  set  is  also  schedulable.  The  utilization  of  the  network  is  60  %. 
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6.  Conclusion 


The  rate  monotonic  theory  and  its  generalizations  have  been  adoDted  by  national  high  tech¬ 
nology  projects  such  as  the  Space  Station,  and  have  recently  been  supported  by  major 
open  standards  such  as  the  IEEE  Futurebus+.  In  this  paper,  we  have  given  a  synopsis  of 
GRMS  for  centralized  systems.  Furthermore,  we  have  described  the  basic  concepts  in  the 
extensions  of  GRMS  for  distributed  system  scheduling,  namely  transmission  schedulability, 
system  scheduling  consistency,  and  preemption  control.  We  also  introduced  the  notion  of 
scheduling  abstractions  as  a  technique  for  analysis  of  systems  that  do  not  directly  support 
GRMS.  Finally,  we  have  provided  an  application  example  to  illustrate  the  assignment  of 
message  and  task  deadlines,  task  scheduling  and  message  scheduling. 
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